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ABSTRACT 

We use the historic record of sunspot groups compiled by the Royal Greenwich Observatory together with the sunspot number to 
derive the statistical properties of sunspot group emergence in dependence of cycle phase and strength. In particular we discuss the 
latitude, longitude, area and tilt angle of sunspot groups as functions of the cycle strength and of time during the solar cycle. 
Using these empirical characteristics the time-latitude diagram of sunspot group emergence (butterfly diagram) is reconstructed from 
1700 onward on the basis of the Wolf and group sunspot numbers. This reconstruction will be useful in studies of the long-term 
evolution of the Sun's magnetic field. 
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1. Introduction 

The synoptic record of sunspot emergence is an important in- 
put into, for example, long t erm re constructions of the solar 
open flux (e.g. Cameron et an,l2010l) . solar irradiance variations 
(e.g. ICrouch et al.L 120081) and is rele vant for understandin g the 
solar dynamo (for a recent review see ICharbonneaul 1201 Ob . The 
quantities which are used in these types of studies include the 
sunspot areas, emergence latitudes and longitudes as well as the 
tilt angle between sunspots of opposite p olarities within a group . 
As an example of such a study, in ICameron" etajJ hoifll) 



we used the observed Royal Greenwich ObservatoryU (RGO) 
records of sunspot areas, longitudes, latitudes and the Mount 
Wilson Observatory and Kodaikanal recor ds of tilt angles 
(Howard et al., 1984, 1999; Sivaraman et^aP. Il993h as t he in- 
put to a surface flux transport model (e . g. iDevore et al.l 19851: 
ISheelev et al.l 119851: IWang et al.L Il989t iBaumann et airi2004 . 
The results from the model was the large-scale evolution of the 
surface magnetic field, which was extrapolated into interplane- 
tary space using a current shee t sourc e surface (CSSS) extrapo- 
lation (e.g. IZhao & HoeksemaLlT995l) . The open field calculated 
from the model was then compared to that inferred from obser- 
vations of the geomagnetic aa index. The time period analyzed 
was restricted by the fact that the RGO dataset only extends back 
to 1874 and the MWO and Kodaikanal tilt angle datasets are 
even shorter 

The purpose of this paper is to construct partially synthetic 
datasets of sunspot emergence coveting the period from 1700 to 
2010. In the second paper in this series we intend to use these 
semi-synthetic records with the surface flux transport model and 
CSSS extrapolation. The semi-synthetic data sets however have 
a much wider application, for example in irradiance studies and 
in understanding the solar dynamo. We therefore here present 
the analysis and methods for creating them. 



The time dependence of the emergence is taken from the 
sunspot number data, either the monthly group sunspot number, 
Rc (Hovt & Schatten, 1998), or the monthly Wolf sunspot num- 
ber, Rz dWolfl 1 186 lb . Correlations between the strength of the 
cycle, derived from Rq or Rz, and the areas, emergence latitudes 
and longitudes, and tilt angles of sunspot groups are sought. For 
this purpose we use the Royal Greenwich Observatory (RGO) 
record of sunspot group areas, latitudes, longitudes as well as 
the MWO and Kodaikanal recor ds of sunspot group tilt an- 



gles. Unlike previous studies (e .g. Li et al.L 20031; Solanki et al.L 



120081: iDasi-Espuig et all |2010L llvanov & MiletskvL 1201 Ih . we 



' All data was obtained from the NOAA website 
|http://www.ngdc.noaa.gov/stp/ solar/sola rdataservices.html| 



consider correlations of many of the emergence properties with 
cycle properties derived from the monthly sunspot number. 



These correlations are then used in conjunction with the Rq 
and Rz records to construct artificial sunspot group data extend- 
ing back to 1700. As the time dependence is taken from observa- 
tions and the other properties of the sunspot groups are synthetic, 
the constructed timeseries is semi-synthetic. Since the correla- 
tions are only statistical in nature, the individual reconstructions 
are realizations drawn randomly from a population with the ob- 
served statistics. This enables Monte-Carlo-type studies on the 
longer term evolution of, e.g. the Sun's open flux, polar fields 
and irradiance variations. 



The paper is structured as follows: in Section 2 we describe 
the datasets and ways to define various cycle parameters such as 
cycle strength, starting time and length of each cycle. In Section 

3 we discuss the correlations between the spatial distribution and 
the propeities of the cycle as determined from Rc- In Section 

4 we use these correlations to reconstruct the butterfly diagram 
from 1700 onwards. 
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Fig. 1. Time evolution of monthly Wolf sunspot number Rz (blue 
line) and group sunspot number /?g (red line). The dotted vertical 
line denotes the year 1874, after which the two sunspot number 
datasets are nearly identical. Note that Rc ends in 1995, after 
which it is assumed to be equal to Rz- 

2. Cycle parameters determined from sunspot 
numbers 

2.1. Sunspot number datasets 

Monthly values of the Wolf sunspot number, Rz, are avail- 
able from 1749 onward and yearly values reach back to 1700. 
Monthly values of Rz before 1749 can be estimated by interpola- 
tion of yearly Rz values. The group sunspot number Rq extends 
further back in time, to 1610, and is again interpolated to obtain 
monthly values whe n the dataset is in complete. As described by 
lUsoskinI (l2008h and lHathawavl (1201 Oh the two datasets have dif- 
ferent definitions and depend on different combinations of solar 
observations. 

Figure [1] shows the two sunspot numbers over the period 
from 1700 to 2010. After the 1870s they are nearly identical. 
Since this covers the period of more detailed RGO sunspot data, 
the two data sets are almost equivalent for use in determining the 
empirical correlations with the RGO data. We have (arbitrarily) 
chosen to use Rc. Differences however are to be expected in the 
reconstructed sunspot group data prior to 1874, as will be seen 
in Section 4. 

2.2. Cycle parameters 

On the basis of the Rc data we define three parameters for each 
cycle. Two of these parameters concern the strength of a cycle. 
The first, S„, is the maximum of the 12 month running mean of 
Rg- The second, 5„, is the sum of Rc over the cycle. 

We also need information as to the timing of the cy- 
cle, and for this we u se the time of the solar minima, 
(iHarvev & Whitelll999h . which we take from the NGDC web- 
site. These times and cycle strengths are listed in Table [1] 

3. Characteristics of sunspot group emergence 
derived from the RGO sunspot data 

In this section we discuss the empirical relationships between 
the strength of the cycle given in Table 1 and the latitudes, lon- 
gitudes and areas of sunspot groups as recorded in the RGO 
dataset. The RGO records cover the period from 1874 to 1976 
(cycles 12 to 20). The sunspot groups are considered at the times 
of their maximum reported area. 

Correlations between the strength of the cycle and the tilt an- 
gle of sunsp ot groups rely on the MW O and Kodaikanal data as 
discussed in lDasi-Espuig et al] ( l2010b . Some properties, such as 
the average latitude at which sunspots emerge, vary throughout 



Table 1. Parameters for solar cycles -4 to 23 (from 1700 on- 
ward) derived from the group sunspot number Rc. 
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92 


156 


23 


19964 


81 


119 



the solar cycle. In this case it is important to consider the data 
from different cycles at the same phase. We then look for cor- 
relations at a fixed phase through the cycle, where the cycle is 
taken to begin and end at adjacent activity minima. 

3.1. Latitude distribution 

The latitude distribution of sunspots is the clearest example of 
where it is necessary to consider the phase during the cycle: early 
in the cycle sunspots appear at higher latitudes than later in the 
cycle. 

We break the time between adjacent minima into 30 equal 
phases. For each cycle, n, we can then calculate the mean lati- 
tude, A'^, averaged over the /* phase bin. Figure |2] shows /IJ, for 
cycles 12 to 20 as a function of the phase of the cycle. Because 
the cycles partially overlap, the first 3 phase bins near the start 
and the last 3 near the end of a cycle show a mixture of spots 
from adjacent cycles. This mixture is of lower-latitude spots 
from the end of a cycle and higher-latitude spots from the begin- 
ning of the subsequent cycle. The average latitude during these 
initial and final phases is thus difficult to interpret. 

J 20 

The phase average A' over all cycles defined as - ^ /Ij, is 

n=12 

well fit by a second degree polynomial: 

a' = 26.4 - 34.2(//30) + 16.1(//30)^ (1) 

over the range 4 < ; < 27. The rms difference between the fit 

and the mean latitudes is 0.3^ 

As previously reported (iLi et all l2003l ISolanki et all l2008h . 
there is a strong correlation between the strength of the cycle and 
the average latitudes of emergence. To evaluate this correlation 
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Fig. 2. Average latitude of sunspot groups at different cycle 
phases. Colors indicate different cycles (cycles 12-20). The 
thick red curve shows the polynomial fit given by Equation ([1} to 
all cycles. The two vertical dotted lines indicate the times when 
the overlapping of cycles strongly affects the average latitudes. 
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Fig. 3. Correlation between cycle averaged latitudes /i„ and cycle 
strength defined by the total sunspot number (S„, left panel) and 
the maximum sunspot number (S„, right panel), respectively. 



we calculated the mean latitude of emergence as 



1 

" ^ 24 ^ 

1=4 



(2) 



Figure [3] shows the relation between the average latitudes A„ and 
cycle strengths defined by S„ (left panel) and S„ (right panel). In 
both cases, a statistically significant (/9-value < 0.05) correlation 
coefficient higher than 0.9 is found. The correlations between S,, 
and An (r=0.94) and between §„ and A„ (r=0.92) are similar We 
hereafter focus on S„. The linear fit for the A„ is given by 



A„ = 12.2 + 0.0225,,. 



(3) 



This observed correlation can be used with Equation (|2} to 
model the phase dependence of the mean latitude of emergence 
for different cycles: 

a'„ = (26.4 - 34.2(//30) + 16.1(//30)2)(^/a)i2-2o) (4) 

for 1 < / < 30 and where {/l)i2-20 - 14.6° is the average latitude 
of sunspot emergence over all the cycles. The fit to each cycle is 
shown in Figure|4] The mean rms deviation between observation 
and reconstruction, excluding the first and last two years of each 
cycle, is 1.33°. 



3.2. Width of the latitude distribution 

Sunspots emerge over a range of latitudes at any phase of the 
cycle. We consider the standard deviation, cr|j, of the latitudinal 






Fig. 4. Comparison of the mean latitudes of sunspot emergence 
for cycles 12-20 between the observation (black line, monthly 
average) and the fit (red line). The rms deviations (in degrees) 
between observation and reconstruction are given excluding the 
first and the last two years of a cycle. 

distribution during phase / of cycle n. The upper panel in Figure 
|5]shows crjj for cycles « = 12-20. A tighter relationship is found 
if we consider the ratio crj, / /Ij^ which is shown in the lower panel 
of the same figure. 

A second-order polynomial fit 

cr' = (0.14 + 1.05(;/30) - 0.18(i/30f)A' (5) 

matches the data well. In our semi-synthetic reconstructions we 
assume a Gaussian distribution with a half width of crj^ and ex- 
clude points deviating from the mean by more then 2crjj. 



3.3. Longitude distribution 



The emergence longitudes of su nspot groups is 
to be not entirely ra ndom (e.g. 



known 

JjBumba & Howard, ' 



'1961 
3j. 



Bai, 1988; Berdyugina & Usoskinl |2003; Zhang et al., 2001 
iCastenmiller et al ] (ll986h introduced the term 'sunspot nests' to 
describe the tendency for sunspots to appear near where other 
spots. Up to 30% of the sunspot groups have previously been 
found to be associated with such nests. 

Our motivation for considering the longitudes distribution is, 
for example, that the open flux of the Sun during activity max- 
ima is dominated by the e quatorial low order multipoles (see, 
e.g. ICameron et al.L l2010h . The strengths of these multipoles, 
and the dipole in particular, depend on how randomly or sys- 
tematically the sunspots appear in longitude. We therefore be- 
gin by considering the equatorial dipole moment of an indi- 
vidual sunspot group. The first step is to convert the sunspot 
area . A, in the RGO dataset into magn etic flux. Here we fol- 
low Ivan Ballegooiien et alJ (Il998h and iBaumann et al ] (12004 
and take the total flux of the group to be proportional to the area 
of the active region (sunspot area and plage area) 

Ar = A+414 + 21A-0.0036A2 (6) 

where all values are in //Hem jChapman et aUlT997h . The equa- 
torial dipole moment of the sunspot group is then assumed to 
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Fig. 5. Upper panel: standard deviation crj, of the latitudinal dis- 
tribution of phase-binned latitudes for cycles 12-20. The line 
style is the same as in Figure |2] Lower panel: similar except 
(xjj/AJ, is shown. The thick red curve shows the polynomial fit 
given by Equation Q. 



be proportional to the area of the group, A«, multiplied by the 
separation between the opposing polarities which we take to be 

1 /2 

proportional to . For a sunspot group j in the northern hemi- 
sphere and near the equator, with area Ar^- and central longitude 
(pj, the axis of the equatorial dipole is orientated in the direction 
± 90°. The component of the dipole in the equatorial plane in 

direction is thus proportional to y cos(0 - For a number 
of sunspots groups in the northern hemisphere, the component 
of the resulting dipole moment in the direction is proportional 
to 

^A^(]cos(0-0,). (7) 

north 

Since the sunspot groups in the southern hemisphere have the 
opposite polarity orientation, the corresponding component of 
the dipole moment is proportional to 

- 2a^(]cos(<^-0;). (8) 

south 

Therefore dipole moment of all the spot groups from both hemi- 
spheres is proportional to 

north south 



Clearly, m depends on the sunspots which are included in the 
sum as well as the direction 0. We define m{t, t, (p) to include in 
the sum all spots emerging between times f and t + t. 

We are not able to reproduce the longitudes at which 
sunspots have appeared from 1700 onwards or even those longi- 
tudes where nesting has occurred. Our aim is to obtain statistical 
information about the degree of nesting by measuring the de- 
gree of non-randomness present in the RGO data. We do this by 
creating three copies of the RGO records. In the first copy we re- 
place the observed longitudes with randomly chosen longitudes 
from a uniform distribution. We use the subscript notation Wjan 
for this dataset. The second copy has the longitudes of its spots 
changed so that they all appear at 0° in the northern hemisphere 
and 180° in the southern hemisphere. This choice maximizes the 
equatorial magnetic dipole moment since sunspot groups in op- 
posite hemispheres have opposite polarity orientations in accor- 
dance with Hale's law. We use the subscript notation niord for 
this dataset. The third copy retains the observed longitudes and 
uses the subscript notation mobs. 

For a given r we measure the amount of nonrandomness, 
c(t). We assume that the magnitude of the observed sunspot 
dipole moment 

Mohsit, t) = max{;7Zobs(f, r, 0)) (10) 

"A 

is the magnitude of a linear combination of the random and or- 
dered datasets 

Mc(f, r) = max{c(r);7Zord(f, r) -t- [1 - c(r)]OTi-a„(f, r)) (11) 

and seek the c(t) which minimizes the rms differences. Figure|6] 
shows Mobs, A/ran and Mc for the example t = 6 months. The dif- 
ference between Mobs and Mran during activity maxima reflects 
the amount of nesting, i.e., the nonrandom longitude distribu- 
tion. The similarity of all the curves near the minima is a conse- 
quence of the fact that, when there are few spot groups, they are 
automatically highly ordered. We show the dependence of c on 
T in Figure |2] 

For different studies the appropriate value of t will vary. For 
irradiance studies r = 1 month would seem to be an appropriate 
choice because it is the instantaneous clustering of the sunspots 
which is important. For surface flux transport simulations, r = 
6 months is more relevant as this is approximatley the time it 
takes for the emerging flux to be sheared by differential rotation 
(ISchriiver & ZwaanLl2000[ p. 162). 

3.4. Area distribution 

In this section we consider the area distribution of sunspot 
groups based upon the RGO dataset. As previously stated, we 
consider each group only at the time when it has its maximum 
recorded area. We begin by considering the area distribution of 
the entire dataset. We will then look at the partially related ques- 
tions of its dependence on the phase during the cycle and on the 
latitudes at which the spots appear. 

3.4.1. Area distribution function 

Figure [8] shows the number density function of sunspot group 
areas. The behaviour is approximately a power law below 
300 yuHem with a turno ver to an a lmost log-normal distribution 
above (see also Zhang et al.Ll2010l) . There is a large range (from 
60 to 300 yuHem) where both functional forms are good approx- 
imations to the data. The fits for the two sections of the curves 
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Fig. 6. Time evolution of the proxy for the equatorial dipole mo- 
ment, M, during 1874-1976 for t - 6 months. The random (blue 
curve), observed (black curve) and combined (red curve) models 
for the longitude distribution of sunspot emergence are shown. 
The value of c = 0.15 corresponds to t = 6 months. 



Fig. 8. Number density function of the group areas for each cycle 
of RGO area dataset. The two red dashed curves show the fits 
from Equations (fTSl i and ( fTlT l. 
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Fig. 7. Degree of non-randomness, c, as a function of the pe- 
riod over which we take the sunspots, r. Error bars represent the 
standard deviation resulting from 20 sets of random longitudes 
to obtain M^n- 
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Fig. 9. Mean sunspot group area for cycles 12-20. The red curve 
is the fit given by Equation ( fT4l i. 



3.4.2. Cycle phase dependence of area distribution 

We next consider dependence on the cycle phase and latitude of 
the area distribution. For the phase-of-cycle dependence we use 
the same type of analysis as in Section [TTI Figure |9] shows the 
area distribution for different cycles as a function of the phase 
1 < i < 30. The average value over all cycles can be fitted by the 
second degree polynomial 



«(A) = 0.003 exp[ J— (In A - \n4-5f] for A > 60uHem. 

2 In 3 

(13) 

The differences between the two fits are mainly in the 
tails, which is a partial explanation of why both log- 
normal and power-la w distribution s have been reported in 
the p a st (see, e.g. Boedan et all 1988L Harvev & Zwaar , 
19931: iBaumann & SolankiL l2005t iHarvev & ZwaanL 11993 ; 
Schriiver & Harvevi Il994b~ We also comment that we are here 
considering the sunspot group areas from the RGO record, which 
includes both the umbral and penumbral area but excludes the 
area of the plage. This also partially explains why our results 
can differ from those of previous authors. 



Ai = 115 -1- 396(i730) - 426(i/30)^ 



(14) 



There is also a (possibly related) dependence of the areas on 
latitude. Figure[TO]shows the number density function of sunspot 
group areas for 5 degree binned latitudes. Even after averaging 
the data in this way there is still some scatter apparent in the data. 
The relative scatter can, which can be judged from the latitude- 
to-latitude variation in the plot, increases with area for each lat- 
itude bin. This is because there are relatively few large sunspot 
groups. The figure also indicates that large sunspots rarely occur 
at low (< 5) latitudes. This partly reflects the phase dependence 
of the area distribution. 
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Fig. 10. Number density function for 5 degree-binned latitudes. 
3.5. Tilt angle distribution 

Joy's law dHale et al.Lll919l) states that the line joining the cen- 
tres of the positive and negative polarities of sunspot groups 
is systematically tilted with respec t to the East-West direction. 
Recently, Dasi-E spuig et al.l (1201 Ol) found that the tilt angles of 
sunspot groups, obtained from the Mount Wilson Observatory 
and from Kodaikanal observations, show a cycle-to-cycle vari- 
ation. They further showed that the average tilt angle is nega- 
tively correlated with the strength of the cycle, i.e. the tilt an- 
gles tend to be smaller for stronger cycles. Incorporating the 
cycle-dependent tilt angles of sunspot groups in a surface flux 
transport model, ICameron et al.l (1201 Ol) reproduced the empiri- 
cally derived time evolution of the solar open magnetic flux and 
the reversal times of the polar fields from 1913 to 1986 (cycles 
15-21), here we consider only the time period covered by the 
RGO data and hence omit cycle 21 from our analysis. The pre- 
cise dependence of the tilt angle on latitude is uncertain and it 
seems that a square root profile matches the data somewhat bet- 
ter than the usually assumed linear relationship. In this paper, 
as in CJSSIO, we consider the square root profile a,, - T„ VPI, 
where Qr„ is the average tilt angle and T„ is the constant of pro- 
portionality for cycle n. 

For each cycl e we d eter mined T„ in a s imilar way as in 
iDasi-Espuig et all ( 1201 Ol) and lCameron et al.l ( |2010|) . Figure [TT] 
shows the correlation between cycle strength S„ and T„. The cor- 
relation coeflicient for S„ is 0.81, slightly higher than that of S„ 
which is 0.78. Under either definition, the cycle strength and T„ 
are significantly correlated {p < 0.05). The linear fit between S„ 
and T„ is 

r„ = 1.73 -0.00355 „ (15) 



3.6. Number of sunspot groups as a function of time 

Our aim in this paper is to construct semi-synthetic sunspot 
group records, based on Re and Rz, with similar statistics to 
those of the RGO data. We have used Re to determine strength 
of each cycle and have found correlations which allow us to 
construct synthetic latitudes, longitudes, areas and tilt angles for 
each spot group. We here determine how many sunspot groups 
should appear each month to make the semi-synthetic record 
have similar statstics as the RGO dataset. For the period covered 
by the RGO records, the monthly number should be approxi- 
mately the same as the number of groups in the RGO dataset, 
Nsc- We have found that the fit Rc 1 2.1, shown in Figure [T2l 
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Fig. 11. The relationship between cycle strength S „ and T„. 
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Fig. 12. A comparison of the number of sunspot groups appear- 
ing each month in the RGO data, Nsc^ (black curve) and fit based 
upon Rc (red dashed curve). 



matches the data well. We use this fit to reconstruct the number 
of sunspot groups appearing each month from 1700 onwards. 



4. Reconstruction of sunspot group emergence 

Our goal is to reconstruct sunspot group emergence back to 1700 
based solely on sunspot numbers. For that we use the derived 
relationships between sunspot groups (mean latitude, latitude 
width, longitude distribution, area distribution and tilt angles) 
and activity indices (sunspot numbers, strength, and phase of the 
cycles) to estimate the spatial and temporal properties of emerg- 
ing sunspot groups. Since the relationships we have derived are 
only in terms of correlations, our models have a random compo- 
nent. For the latitudes and areas we draw from random popula- 
tions which have the relevant distributions set out in the previous 
sections. 

Implicit in such a reconstruction is the assumption that the 
dynamo has operated in a similar way from 1700 onwards. 
The very limited records of observations during the earlier 
part of the 18th century indicate that some of the early cy- 
cles might be anomalous in having stronger activity near the 
equator than those of the better observed later cycles (e.g., see 
■ Ribes & N esme-Ribes, 1993; ^t, 2QQ9). This could indicate 
that the dynamo was operating in a not purely dipole mode dur- 
ing this period. 
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Fig. 13. A comparison of observed (upper panel) and semi- 
synthetic (lower panel) butterfly diagrams for the years 1874- 
2010. 
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Fig. 14. Comparison of butterfly diagrams from observation 
(above the horizontal dashed lines) and reconstruction (below 
the dashed lines) for the weakest cycle 14 (upper panel) and 
the strongest cycle 19 (lower panel), both for the northern hemi- 
sphere. The area of the sunspot groups is indicated by the colors 
and sizes of circles. 



4.1. A semi-synthetic butterfly diagram covering 1874-2010 

We first present an example semi-synthetic butterfly diagram for 
the period from the start of the RGO records to 2010. This allows 
us to directly compare the semi-synthetic and observed butterfly 
diagrams in Figure[T3] As expected, the two diagrams have simi- 
lar appearances. A more detailed comparison of the weakest and 
strongest cycles is shown in Figure [14] Again the observed and 
semi-synthetic butterfly wings look similar This validates the 
use of the semi-synthetic reconstruction for periods when we 
only have the sunspot numbers. 



4.2. Comparison of reconstructions using Rq and Rz during 
1700-1874 

The semi-synthetic model shown in Figures[T3]and[T4]was based 
on the group sunspot number R^. Prior to 1874 Rz and Re 





Fig. 15. Semi-synthetic butterfly diagram for the years 1700- 
1874 using Rc (upper panel) and Rz (lower panel). 
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Fig. 16. Mean sunspot latitudes for the years 1700-1874 recon- 
structed with Rz (blue curves) and Rc (red curves). 



have substantial diff'erences which affect the reconstructed but- 
terfly diagrams. Figure [15] shows the reconstructed butterfly di- 
agram during 1700-1874 with Rc and Rz, respectively. It will 
be very interesting to compare both semi-synthetic butterfly di - 
agrams with those being obtained bv lArlt & Abdolvandl(l2010l) . 
We comment that there is no reason emerging from this study to 
prefer one data set over the other 

To give another indication of the differences in the recon- 
structions based on Rc and Rz Figure[T6]shows the reconstructed 
mean latitudes during 1700-1874. The different cycle strengths 
derived from the two sets of sunspot numbers produce small dif- 
ferences which differ in strength from cycle to cycle. The extent 
to which these differences affect the results of surface flux trans- 
port simulations and the open flux calculated therefrom will be 
investigated in Paper II. 



5. Conclusions 

Using the group sunspot number Rc and RGO, MWO and 
Kodaikanal data sets, we studied the phase dependence and cycle 
dependence of latitude, area and tilt angle distribution properties 
of sunspot group emergence. The main correlations found are: 

1 . The mean latitude at which sunspots emerge can be mod- 
eled using a second order polynomial of cycle phase. 

2. Strong cycles have a higher mean latitude for sunspot 
emergence (Figure [3} . 
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3. The ratio of the latitudinal range over which sunspot 
groups emerge and the average latitude of emergence varies as a 
function of cycle phase (Figure |5j. 

4. The distribution of sunspot areas is similar for all cycles 
(Figure |8]l. 

5. The size distribution is a power-law for small sunspots and 
obeys a log-normal profile for large sunspots (Figure |8). 

6. During cycle maxima sunspots are, in the mean, larger 
(Figure |9]l. 

7. Sunspot nests are important, especially during cycle max- 
imum phases (Figure 6). 

We have modeled and used the correlations to construct 
semi-synthetic butterfly diagrams extending back to 1700. This 
reconstruction will be useful in modeling the large-scale solar 
magnetic field over this period. 

References 

Arlt, R. 2009, Sol. Phys., 255, 143 

Arlt, R. & Abdolvand, A. 2010, ArXiv e-prints 1010.3131 
Bai, T. 1988, ApJ, 328, 860 

Baumann, I., Schmitt, D., Schussler, M., & Solanki. S. K. 2004, A&A, 426, 1075 

Baumann, I. & Solanki, S. K. 2005, A&A, 443, 1061 

Berdyugina, S. V. & Usoskin, I. G. 2003, A&A, 405, 1121 

Bogdan, T. J., Oilman, R A., Lerche, I., & Howard, R. 1988, ApJ, 327, 451 

Bumba, V. & Howard, R. 1965, ApJ, 141, 1502 

Cameron, R. H., Jiang, J., Schmitt, D., & Schussler, M. 2010, ApJ, 719, 264 
Castenmiller, M. J. M., Zwaan, C, & van der Zalm, E. B. J. 1986, Sol. Phys., 
105, 237 

Chapman, G. A., Cookson, A. M., & Dobias, J. J. 1997, ApJ, 482, 541 
Charbonneau, P. 2010, Living Rev. in Solar Phys., 7 

Crouch, A. D., Charbonneau, P, Beaubien, G., & Paquin-Ricard, D. 2008, ApJ, 
677, 723 

Dasi-Espuig, M., Solanki, S. K., Krivova, N. A., Cameron, R., & Peiiuela, T. 

2010, A&A, 518, A7 
Devore, C. R., Boris, J. R, Young, Jr, T. R., Sheeley, N. R., & Harvey, K. L. 

1985, Australian J. Phys., 38, 999 
Hale, G. E., Ellerman, R, Nicholson. S. B., & Joy, A. H. 1919, ApJ, 49, 153 
Harvey, K. L. & White, 0. R. 1999, J. Geophys. Res., 104, 19759 
Harvey, K. L. & Zwaan, C. 1993, Sol. Phys., 148, 85 
Hathaway, D. H. 2010, Living Rev. Solar Phys., 7, 1 
Howard, R., Gilman, P I., & Gilman, P A. 1984, ApJ, 283, 373 
Howard, R. P., Gupta, S. S., & Sivaraman, K. R. 1999, Sol. Phys., 186, 25 
Hoyt, D. V. & Schatten, K. H. 1998, Sol. Phys., 179, 189 
Ivanov, V. G. & Miletsky, E. V. 2011, Sol. Phys., 268, 231 
Li, K. J., Wang, J. X., Zhan, L. S., et al. 2003, Sol. Phys., 215, 99 
Ribes, J. C. & Nesme-Ribes, E. 1993, A&A, 276, 549 
Schrijver, C. J. & Harvey, K. L. 1994, Sol. Phys., 150, 1 

Schrijver, C. J. & Zwaan, C. 2000, Solar and stellar magnetic activity, Cambridge 

Astrophysics Series, 34 (Cambridge, UK: Cambridge University Press) 
Sheeley, Jr, N. R.. Devore, C. R., & Boris, J. P 1985, Sol. Phys., 98, 219 
Sivarainan, K. R., Gupta, S. S., & Howard, R. R 1993, Sol. Phys., 146, 27 
Solanki, S. K., Wenzler. T., & Schmitt, D. 2008, A&A, 483, 623 
Usoskin, L G. 2008, Living Rev. Solar Phys., 5, 3 

van Ballegooijen, A. A., Cartledge, N. P, & Priest, E. R. 1998, ApJ, 501, 866 
Wang, Y., Nash, A. G.. & Sheeley, Jr, N. R. 1989, Science, 245, 712 
Wolf, R. 1861, MNRAS, 21, 77 
Zhang, J., Wang, Y, & Liu, Y. 2010, ApJ, 723, 1006 

Zhang, L. Y, Wang, H. N., Du, Z. L., Cui, Y. M., & He, H. 2007, A&A, 471, 
711 

Zhao, X. & Hoeksema, J. T. 1995, Space Sci. Rev., 72, 189 



8 



